Consistency of anchor-based spectral clustering
نویسندگان
چکیده
Abstract Anchor-based techniques reduce the computational complexity of spectral clustering algorithms. Although empirical tests have shown promising results, there is currently a lack theoretical support for anchoring approach. We define specific anchor-based algorithm and show that it amenable to rigorous analysis, as well being effective in practice. establish consistency method an asymptotic setting where data sampled from underlying continuous probability distribution. In particular, we provide sharp conditions number nearest neighbors algorithm, which ensure can recover with high disjoint clusters are mutually separated by positive distance. illustrate performance on synthetic explain how convergence analysis be used inform practical choice parameter scalings. also test accuracy efficiency two large scale real sets. find offers clear advantages over standard clustering. competitive state-of-the-art LSC Chen Cai (Twenty-Fifth AAAI Conference Artificial Intelligence, 2011), while having added benefit guarantee.
منابع مشابه
Consistency of Spectral Clustering
Consistency is a key property of all statistical procedures analyzing randomly sampled data. Surprisingly, despite decades of work, little is known about consistency of most clustering algorithms. In this paper we investigate consistency of the popular family of spectral clustering algorithms, which clusters the data with the help of eigenvectors of graph Laplacian matrices. We develop new meth...
متن کاملA variational approach to the consistency of spectral clustering
This paper establishes the consistency of spectral approaches to data clustering. We consider clustering of point clouds obtained as samples of a ground-truth measure. A graph representing the point cloud is obtained by assigning weights to edges based on the distance between the points they connect. We investigate the spectral convergence of both unnormalized and normalized graph Laplacians to...
متن کاملRobust path-based spectral clustering
Spectral clustering and path-based clustering are two recently developed clustering approaches that have delivered impressive results in a number of challenging clustering tasks. However, they are not robust enough against noise and outliers in the data. In this paper, based on M-estimation from robust statistics, we develop a robust path-based spectral clustering method by defining a robust pa...
متن کاملSACOC: A Spectral-Based ACO Clustering Algorithm
The application of ACO-based algorithms in data mining is growing over the last few years and several supervised and unsupervised learning algorithms have been developed using this bio-inspired approach. Most recent works concerning unsupervised learning have been focused on clustering, where ACO-based techniques have showed a great potential. At the same time, new clustering techniques that se...
متن کاملSpectral Clustering Based on Local PCA
We propose a spectral clustering method based on local principal components analysis (PCA). After performing local PCA in selected neighborhoods, the algorithm builds a nearest neighbor graph weighted according to a discrepancy between the principal subspaces in the neighborhoods, and then applies spectral clustering. As opposed to standard spectral methods based solely on pairwise distances be...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information and Inference: A Journal of the IMA
سال: 2021
ISSN: ['2049-8772', '2049-8764']
DOI: https://doi.org/10.1093/imaiai/iaab023